NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Essential barrier height and a probabilistic approach in characterizing potential landscape

https://doi.org/10.1016/j.spa.2025.104763

Li, Yao; Tao, Molei; Wang, Shirou (December 2025, Stochastic Processes and their Applications)

Free, publicly-accessible full text available December 1, 2026
Fast Non-Log-Concave Sampling under Nonconvex Equality and Inequality Constraints with Landing

Jeon, Kijung; Muehlebach, Michael; Tao, Molei (September 2025, NeurIPS)

Free, publicly-accessible full text available September 18, 2026
Non-equilibrium Annealed Adjoint Sampler

Choi, Jaemoo; Chen, Yongxin; Tao, Molei; Liu, Guan-Horng (September 2025, NeurIPS)

Free, publicly-accessible full text available September 18, 2026
Essential barrier height and a probabilistic approach in characterizing potential landscape

Li, Yao; Tao, Molei; Wang, Shirou (August 2025, Stochastic processes and their applications)

Free, publicly-accessible full text available August 22, 2026
A Closer Look at Model Collapse: From a Generalization-to-Memorization Perspective

Shi, Lianghe; Wu, Meng; Zhang, Huijie; Zhang, Zekai; Tao, Molei; Qu, Qing (September 2025, NeurIPS)

Free, publicly-accessible full text available September 18, 2026
MDNS: Masked Diffusion Neural Sampler via Stochastic Optimal Control

Zhu, Yuchen; Guo, Wei; Choi, Jaemoo; Liu, Guan-Horng; Chen, Yongxin; Tao, Molei (September 2025, NeurIPS)

Free, publicly-accessible full text available September 18, 2026
Fast Solvers for Discrete Diffusion Models: Theory and Applications of High-Order Algorithms

Ren, Yinuo; Chen, Haoxuan; Zhu, Yuchen; Guo, Wei; Chen, Yongxin; Rotskoff, Grant M; Tao, Molei; Ying, Lexing (September 2025, NeurIPS)

Free, publicly-accessible full text available September 18, 2026
Variational Learning Finds Flatter Solutions at the Edge of Stability

Ghosh, Avrajit; Cong, Bai; Yokota, Rio; Ravishankar, Saiprasad; Wang, Rongrong; Tao, Molei; Khan, Mohammad Emtiyaz; Möllenhoff, Thomas (September 2025, NeurIPS)

Free, publicly-accessible full text available September 18, 2026
Robust First- and Second-Order Differentiation for Regularized Optimal Transport

https://doi.org/10.1137/24M1674030

Li, Xingjie; Lu, Fei; Tao, Molei; Ye, Felix X-F (June 2025, SIAM Journal on Scientific Computing)

Applications such as unbalanced and fully shuffled regression can be approached by optimizing regularized optimal transport (OT) distances, including the entropic OT and Sinkhorn distances. A common approach for this optimization is to use a first-order optimizer, which requires the gradient of the OT distance. For faster convergence, one might also resort to a second-order optimizer, which additionally requires the Hessian. The computations of these derivatives are crucial for efficient and accurate optimization. However, they present significant challenges in terms of memory consumption and numerical instability, especially for large datasets and small regularization strengths. We circumvent these issues by analytically computing the gradients for OT distances and the Hessian for the entropic OT distance, which was not previously used due to intricate tensorwise calculations and the complex dependency on parameters within the bi-level loss function. Through analytical derivation and spectral analysis, we identify and resolve the numerical instability caused by the singularity and ill-posedness of a key linear system. Consequently, we achieve scalable and stable computation of the Hessian, enabling the implementation of the stochastic gradient descent (SGD)-Newton methods. Tests on shuffled regression examples demonstrate that the second stage of the SGD-Newton method converges orders of magnitude faster than the gradient descent-only method while achieving significantly more accurate parameter estimations.
more » « less
Free, publicly-accessible full text available June 30, 2026
Diffuse Everything: Multimodal Diffusion Models on Arbitrary State Spaces

Rojas, Kevin; Zhu, Yuchen; Zhu, Sichen; Ye, Felix X-F; Tao, Molei (July 2025, ICML)

Free, publicly-accessible full text available July 13, 2026

« Prev Next »

Search for: All records